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BACKGROUND OF THE INVENTION 



18 L 



Field of the Invention 



20 



This invention relates to symmetric multiprocessor synchronization and im- 



21 plicit synchronization of resources using migrating scheduling domains, as described 



22 herein. 
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1 2. Related Art 
2 

3 In computer systems having multiple processors with concurrent execution, 

4 it is desirable to use as much of the parallelism as possible from the multiple processors. 

5 One problem with using the parallelism of multiple processors is that of designing soft- 

6 ware to make use of that parallelism. For example, software that was designed for use 

7 with a uniprocessor system often does not exploit the parallelism of a multiprocessor 

8 system to the fullest extent possible. 

9 

Jp A first known method is to redesign or rewrite software originally designed 

[lh for use with a uniprocessor system, so as to make use of the advantages of a multiproces- 
sor system. While this known method does generally achieve the goal of using the ad- 

,13 vantages of a multiprocessor system, it is subject to several drawbacks. First, it is ex- 

#4 tremely expensive, in that it uses relatively large amounts of (human design and coding) 

'H4t<<VFl 

!ffe resources for redesigning or rewriting program code. Second, it is sometimes then neces- 

16 sary to maintain two different code bases, one for uniprocessor systems and one for mul- 

17 tiprocessor systems, also resulting in additional expense and use of human design and 

18 coding resources. 

19 

20 A second known method is to introduce (into software originally designed 

21 for use with a uniprocessor system) those explicit synchronization methods for maintain- 

22 ing integrity of resources to be shared among multiple processors. While this known 
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1 method generally achieves the same goal with relatively less expense and consumption of 

2 resources than a complete redesign or rewrite of the software code base, it also suffers 

3 from several drawbacks. First, it introduces a relatively large amount of new code subject 

4 to possible error in coding. Second, it introduces additional processor and memory usage 

5 to implement known explicit synchronization methods (such as locking mechanisms), 

6 with resulting slowing of the system using those known explicit synchronization methods. 

7 The second drawback is particularly exacerbated for resources that are primarily used by 

8 only one software element, but find occasional use by a second software element; the first 

9 software element pays the price of known explicit synchronization methods for each use 
|o of the resource, even though contention for that resource might be relatively rare. Moreo- 



jh ver, these drawbacks for this second known method are also applicable to the first known 
: |2 method, as a new design would likely employ explicit synchronization methods. 

13 



jj4 A third known method is to identify (within software originally designed for 

S|5 use with a uniprocessor system) those functional elements that can each independently 



16 operate without using known explicit synchronization methods. An example of this third 

17 known method is shown in U.S. Patent 5,485,579; in that patent, each separated fiinc- 

18 tional element is bound to a single processor in a multiprocessor system, so that the sys- 

19 tern can assure that each processor is performing functions that do not require known ex- 

20 plicit synchronization methods. While this method generally achieves the goal of using 

21 the advantages of a multiprocessor system, it is subject to several drawbacks. First, the 

22 mapping between separated functional elements and processors is 1 : 1 , so if the number of 
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1 separated functional elements differs from the number of processors, the system will ei- 

2 ther underutilize at least some of the processors or underperform the functions of at least 

3 some of the separated functional elements. Second, there is no provision for load balanc- 

4 ing among the multiple processors. Third, there is no useful technique for altering the 

5 code base so as to make use of greater parallelism, without resorting to the first known 

6 method described above. 

7 

8 Accordingly, it would be advantageous to provide a technique for schedul- 

9 ing a set of tasks in a multiprocessor system that is not subject to drawbacks of the known 
rffp art. In a preferred embodiment, this is achieved using a method and system for providing 

3 ;i j: :! i: 
^ s 

[¥ 1 parallel execution of those tasks while implicitly synchronizing access to a set of re- 

\j2 sources (such as data structures or hardware devices) used by that system. 

§4 SUMMARY OF THE INVENTION 

as 

" 16 The invention provides a method and system for scheduling a set of tasks in 

17 an MP (multiprocessor) system, and provides parallel execution of those tasks while im- 

18 plicitly synchronizing access to a set of resources (such as data structures or hardware de- 

19 vices) used by that system. Tasks in the MP system are each assigned to a scheduling do- 

20 main, thus associating those tasks with a set of resources controlled by that domain. A 

21 scheduler operating at each processor in the MP system implicitly synchronizes those re- 

22 sources controlled by each domain, by scheduling only one task for each domain to exe- 
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1 cute concurrently in the system. Because each instance of the scheduler selects which task 

2 is next run independently of its processor, each domain can migrate from one processor to 

3 another; thus, each domain can have a task executing on any processor, so long as no do- 

4 main has two tasks executing concurrently in the system. Thus, domains (and their tasks) 

5 are not bound to any particular processor. Hence the method and system are symmetric. 

6 

7 A preferred embodiment uses the implicit synchronization enforced by the 

8 scheduler for resources controlled by a single domain, and performs explicit synchroniza- 

9 tion only for resources shared by more than one domain. When a resource is needed by a 
#) task in a first domain but controlled by a second domain, the task can re-designate itself 
Lti (and thus switch) from the first to the second domain; this allows execution by other tasks 
IJa in the first domain, while preserving domain scheduling heuristics. This technique pro- 
si3 vides for implicit synchronization of resources controlled by the first domain, so that ex- 
jjj4 plicit synchronization is not needed. 



f45 

16 A preferred embodiment can designate a set of tasks known to be MP-safe 

17 (safe for use in an MP system) to not be assigned to any particular domain, as those tasks 

18 can be executed concurrently with all other tasks. MP-safe tasks can include: (a) tasks 

19 that do not use any resources controlled by a particular domain, such as tasks that perform 

20 only computation and/or keep only their own data structures; (b) tasks that already use 

21 explicit synchronization for resources they need; and (c) tasks that are otherwise deter- 

22 mined by programmers to not require explicit synchronization. 
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Those of ordinary skill in the art will recognize, after perusal of this appli- 
cation, the many advantages provided by the invention. These include, but are not limited 
to, the following: 

• The invention provides for MP-safe operation of applications originally designed 
for a UP system without having to substantially rewrite those applications. 

• The invention provides for MP-safe operation of applications having distinguish- 
able functional elements, without having to incur overhead associated with explicit 
synchronization. 

• The invention provides for incrementally altering program code originally de- 
signed for a UP (uniprocesor) system to execute in an MP-safe manner on an MP 
system. 

• The invention provides for increased parallel use of an MP system without having 
to assign any functional elements to particular processors. This is particularly valu- 
able where the number of functional elements are greater than the number of par- 
ticular processors. 



• The invention provides for improved load balancing among the processors in an 
MP system. 
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1 The invention has general applicability to applications of any kind execut- 



2 ing in an MP system in which the scheduler provides implicit synchronization using do- 



3 mains. Although a preferred embodiment is described with regard to a file server, there is 



4 no particular limitation of the invention to file servers or similar devices. Techniques used 



5 by a preferred embodiment of the invention for implicit synchronization, domain migra- 



6 tion, domain switching, and the like can be used in contexts other than the specific appli- 



7 cations disclosed herein. 



8 



9 BRIEF DESCRIPTION OF THE DRAWINGS 



Jji Figure 1 shows a conceptual diagram of a multiprocessor system including 

** ** ■ v 

fcfc a set of migrating scheduling domains. 



s13 



fk Figure 2 shows a diagram of scheduling domains, the tasks and resources 

S(5 that reside in them, and the resource access methods. 

16 



17 Figure 3 shows a conceptual diagram of symmetric scheduling using a set 

18 of migrating scheduling domains. 

19 

20 Figure 4 shows a conceptual diagram of symmetric scheduling in which 

21 tasks can switch scheduling domains. 



22 
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1 Figure 5 shows a process flow diagram of a symmetric scheduler using a set 

2 of migrating scheduling domains. 
3 

4 Figure 6 shows a conceptual diagram of explicit synchronization and im- 

5 plicit synchronization. 

6 

7 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

8 

9 In the following description, a preferred embodiment of the invention is de- 

#) scribed with regard to preferred process steps and data structures. Those skilled in the art 

y|i would recognize after perusal of this application that embodiments of the invention can 

%12 be implemented using one or more general purpose processors or special purpose proces- 

-13 sors or other circuits adapted to particular process steps and data structures described 

214 herein, and that implementation of the process steps and data structures described herein 

P 45 would not require undue experimentation or further invention. 

■* 

16 

17 Related Information 
18 

19 Inventions described herein can be used in conjunction with inventions de- 

20 scribed in the following application(s): 

21 
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• Application Serial No. , Express Mail Mailing No. 

, filed the same day, in the name of inventors Christopher 

PEAK, Sathya BETTADAPURA, and Jeffrey KIMMEL, titled "Automatic Verifi- 
cation of Scheduling Domain Consistency", attorney docket number 1 03 . 1 066.0 1 . 

Each of these application(s) is hereby incorporated by reference as if folly 
set forth herein. They are collectively referred to as the "incorporated disclosures." 

Lexicography 

The following terms refer or relate to aspects of the invention as described 
below. The descriptions of general meanings of these terms are not intended to be limit- 
ing, only illustrative. 

• concurrent — in general, overlapping in time. Concurrent access to a resource in- 
cludes all forms of access in which the data structure or hardware device might be 
found in an inconsistent state as a consequence of access by more than one task. 

• execute or run — in general, when the scheduler grants control of a processor to a 
task to perform the instructions which makeup its program. 
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• executable or runnable - when a task is ready to execute, but has not yet been 
granted control of a processor (due to priority or scheduling domain restrictions). 

• MP-safe — in general, suitable for operation in a multiprocessor environment 
without corruption or loss of data, or improper contention for resources. Tech- 
niques that serve to prevent improper concurrent access render their subject MP- 
safe. 

• resource — in general, any software or hardware object for which concurrent ac- 
cess would need some form of synchronization for safe operation in a multiproces- 
sor system. Examples of resources include, but are not limited to: data structures, 
disk drives, modems, NIC cards, NVRAM, PCI devices, sound cards, and other 
peripherals. 

• schedule — in general, to select a task for execution or running by a processor. 
There is no particular requirement that scheduling involves either (a) operation for 
a selected period of time, or (b) operation in the future. A task is "scheduled" when 
it is made runnable. 

• scheduling domain — in general, a set of tasks and resources selected by design- 
ers or program coders for operation in a multiprocessor system, where it's under- 
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stood that only one task in the set is allowed to ran and access the resources at any 
given time. Multiple tasks from different scheduling domains can run concurrently. 

• symmetric scheduling — in general, scheduling tasks on each processor inde- 
pendent of the identity of the processor. In the invention, the method of scheduling 
performed on each processor is substantially the same, and no tasks are bound to or 
special to any particular processor. Thus, a task is just as likely to run on one proc- 
essor as it is on any other. 

• synchronize — in general, to protect a resource from improper concurrent access. 
Known synchronization techniques include explicit synchronization, in which de- 
signers or program coders include an explicit synchronization mechanism such as a 
lock or semaphore. The invention provides implicit synchronization, in which re- 
sources are synchronized by operation of the scheduling domain restrictions de- 
scribed herein. 

• task — in general, any schedulable entity. A task might include a single process, a 
single thread, or some other individually schedulable entity. 

As noted above, these descriptions of general meanings of these terms are 
not intended to be limiting, only illustrative. Other and further applications of the inven- 
tion, including extensions of these terms and concepts, would be clear to those of ordinary 
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1 skill in the art after perusing this application. These other and further applications are part 

2 of the scope and spirit of the invention, and would be clear to those of ordinary skill in the 

3 art, without further invention or undue experimentation. 

4 

5 System Elements 

6 

7 Figure 1 shows a conceptual diagram of a multiprocessor system including 

8 a set of migrating scheduling domains. 

9 

yio A system 100 includes a plurality of processors 110 and a shared memory 

B 1 12 °- 

-13 Each processor 110 has access to the shared memory 120 and to both (ex- 

Z . (1 

J}4 ecutable) tasks 121 and resources 122 therein. The shared memory includes the tasks 121 

Sfe and the resources 1 12 to be used by those tasks. Tasks 121 and resources 122 are each as- 

16 sociated with a single scheduling domain 123 (with exceptions as described below). Thus, 

17 for example, in a preferred embodiment there are three scheduling domains 123 called 

18 "Network", "Storage", and "Filesystem", each of which has associated therewith one or 

19 more tasks 121 and one or more resources 122. Each processor 110 schedules only those 

20 tasks 121 within scheduling domains 123 not already in use by other processors 110, so 

21 that each scheduling domain 123 is associated with only one processor 1 10 at a time. At 
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1 each moment, each processor 110 executes a task 121 associated with only a single one of 

2 the scheduling domains 123. 

3 

4 As described herein, scheduling domains 123 are not bound or fixed to any 

5 particular processor 110 forever, or even for any specific time period. Each processor 1 10 

6 schedules its tasks 121 independently of which processor 110 is performing the schedul- 

7 ing (that is, scheduling is symmetric with regard to processors 1 1 0), so it is possible for 

8 any scheduling domain 123 to be associated to any processor 110 from time to time, sub- 

9 ject only to the restriction, as further described below, that each scheduling domain 123 
y|o can be associated with only one processor 1 10 at a time. Since tasks 121 within a sched- 
y|i uling domain 123 can be executed at one time by one processor 110 and at another time 

« ^ 

1/12 by a different processor 110, scheduling domains 123 are said to be able to "migrate" 

* 13 from one processor 1 1 0 to another. 

■diva 

4l4 

fi 5 As described herein, resources 1 22 can include data structures, devices, and 

16 other objects for which concurrent access by more than one task 121 would need some 

17 form of synchronization for safe operation in a multiprocessor system. Although the de- 

18 scription herein primarily refers to resources 122 as data structures, there is no particular 

1 9 limitation of resources 1 22 to only data structures in a general form of the invention. 

20 
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1 Scheduling Domain Structure 
2 

3 Figure 2 shows a diagram of scheduling domains, the tasks and resources 

4 that reside in them, and the resource access methods. 

5 

6 The system 100 includes a plurality of scheduling domains 123, each in- 

7 eluding a set of possibly runnable tasks 121 and a set of resources 122. Resource access is 

8 possible via one or more of, or some combination of, the following: 

9 

#) • A task 121 can directly read data from, or write data to, a resource 122 in the same 

— ~ " 

L3h scheduling domain 123. 

*> i» 

= 13 When a task 121 reads data from, or writes data to, a resource 122 in the same 

34 scheduling domain 123, that interface between the task 121 and the resource 122 is 

ftb considered MP-safe (safe for multiprocessor operation), because resources 122 are 

16 implicitly synchronized by the system 100, as described below. 

17 

18 • A task 121 can directly read data from, or write data to, a resource 122 in a differ- 

19 ent scheduling domain 123 (or a resource 122 not assigned to any scheduling do- 

20 main 123). 

21 
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When a task 121 directly reads data from, or writes data to, a resource 122 in a dif- 
ferent scheduling domain 123 (or a resource 122 not assigned to any scheduling 
domain 123), that interface between the task 121 and the resource 122 is not con- 
sidered MP-safe ? unless the resource 122 is explicitly synchronized using an ex- 
plicit synchronization technique, as described below. 

• A task 121 can access data from another domain by proxy. That is, a message can 
be sent to a server task, which resides in the target domain. The server task ac- 
cesses the data directly (on behalf of it's client), and then passes the results back to 
the client task. 

• A task 121 can switch from a first scheduling domain 123 to a second scheduling 
domain 123. 

Figure 4 shows a conceptual diagram of scheduling in which tasks can switch 
scheduling domains. 

Each task 1 2 1 includes an associated label indicating to which scheduling domain 
123 the task 121 belongs. The task's application code 311 in the application layer 
310 performs a "relabel-this-task" system call 313 to scheduler code 322 in the 
kernel layer 320. 
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1 Similar to the description with regard to figure 3, the application code 311 makes 

2 the system call 312 to transfer control to the scheduler code 322. The scheduler 

3 code 322 alters the task's associated label, thus changing the task's scheduling 

4 domain 123, and places the transferor task 121 on the runnable queue 550 (in its 

5 proper location in response to its relative priority). The relabel code 322 transfers 

6 control to the scheduler code 32 1, which (as described with regard to figure 3) se- 

7 lects a next task 121 to execute, performs a context switch into that next task 121, 

8 and "returns" to a second set of application code 3 12 in that next task 12L 

9 

yap Unlike the operation of scheduler code 321 with regard to figure 3, in this figure 

yfi the scheduler code 321 ''returns" back to the same application code 311 in the 

" Jj2 "next" task 121. As a consequence, the task 121 continues to execute the same ap- 

b13 plication code 311, but with a new label indicating that the task 121 belongs to a 

1*4 different scheduling domain 123. 

35 

16 Since the scheduler refuses to schedule more than one task 121 from the originat- 

17 ing scheduling domain 123, if the task 121 attempts to switch domains into a 

18 scheduling domain 123 that is already in use (that is, there is already a task exe- 

19 cuting that belongs to the target scheduling domain 123), the task 121 will not be 

20 selected for execution, and will block until at least that scheduling domain 1 23 is 

21 freed for use. Because the task 121 has been relabeled when the scheduler selects 

22 the next task 121, the scheduler is willing to schedule another task 121 from the 
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"old" scheduling domain 121. For example, if a task 121 in the "Network" sched- 
uling domain 123 switches into the "Storage" scheduling domain 123, the "Net- 
work" scheduling domain 123 is freed for use by other tasks 121 in that domain, 
while the "Storage" scheduling domain 123 is occupied by the switching task 121 
and not available for use by other tasks 121. 

• A task 121 can release control of its processor allowing another task to run, which 
is either in the same or a different scheduling domain 123. 

When a task 121 releases its processor, the task 121 invokes the scheduler (that is, 
the part of the operating system that performs task scheduling), which determines 
which task 121 is next to run on the same processor 110. The scheduler chooses a 
next task 121 to run independent of which processor 110 it is invoked on, but in 
response to which scheduling domains are being executed on other processors 110. 
The scheduler does not select any task 1 2 1 that would cause a scheduling domain 
123 to be executed on more than one processor 1 10 at once. 

Transfer of control from one task 121 to another can result in a different task from 
the same domain being run immediately. However, it is possible that the scheduler 
will select a different, higher priority, task 1 2 1 from a different scheduling domain 
so that the transfer of control will result in that higher priority task 121 being run 
instead. 
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1 Symmetric Scheduling 

2 

3 Figure 3 shows a conceptual diagram of symmetric scheduling using a set 

4 of migrating scheduling domains. 

5 

6 Each task 121 is able to invoke a scheduler on the processor 1 10 on which it 

7 is executing. The scheduler is independent of which processor on which it is running (al- 

8 though in alternative embodiments, it may be that the scheduler takes into account which 

9 processor on which it is running, such as for load-balancing or cache affinity purposes, 
yfi) while still allowing scheduling domains 123 to migrate from one processor 110 to an- 

other). The figure shows a task 121 invoking the scheduler and causing a context switch 

IJ2 to a different task 121 in a different scheduling domain 123. 

-13 

:§4 The task 121 includes application code 311 in an application layer 310 of a 

3fe computing system on its processor 110. The application code 3 1 1 performs a system call 

16 that results in invoking the scheduler code 321 in a kernel layer 320 of the computing 

17 system. Application layers, kernel layers, and system calls are known in the art of operat- 

18 ing systems. 
19 

20 The application code 311 makes the system call 312 which transfers control 

21 to the scheduler code 321. The scheduler code 321 selects a next task 121 to execute, per- 

22 forms a context switch into that next task 121, and "returns" to a second set of application 
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1 code 312 in that next task 121. Unlike known schedulers, the scheduler code 321 selects 

2 only those tasks 121 capable of running without causing a scheduling domain 123 to be 

3 running on two different processors 110 concurrently. 



5 Method of Scheduling 



Figure 5 shows a process flow diagram of a symmetric scheduler using a set 



8 of migrating scheduling domains. 



A method 500 includes a set of flow points and process steps as described 



#i herein. 



=d3 At a flow point 510, application code 311 makes the system call 312 to in- 

;ff4 voke scheduler code 321, and the scheduler is entered. The scheduler uses a queue 550 of 
Sfe runnable tasks 12 1, each of which is labeled with a scheduling domain 123 associated 



16 with that task 121. The queue 550 includes a head 551 of the queue 550, which identifies 

1 7 a particular task 121. 

18 

19 At a step 511, the scheduler examines the queue 550 for a next runnable 

20 task 121. If there is no such task 121 (that is, the queue 550 is empty or has been com- 

21 pletely traced down), the scheduler goes into an idle mode and proceeds with the flow 

22 point 510. (Thus, the idle mode can be entered in one of two different ways: first, if there 
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1 is no next task 121 to run, that is, the runnable queue 550 is empty; second, if there is no 

2 task 121 on the runnable queue 550 capable of being run, due to scheduling domain 123 

3 restrictions.) If there is such a task 121, the scheduler proceeds with the next step. 



5 In alternative embodiments, the runnable queue 550 may be separated into a 

6 separate runnable queue per scheduling domain 123. This implementation may optimize 

7 (speed-up) the scheduler lookup and queue functions. 



8 



"S 



LIS 



16 



At a step 512, the scheduler examines the task 121 at the identified position 
in the queue 550, and determines which scheduling domain 123 the task 121 is associated 



r¥i with. 



3 -i3 At a step 513, the scheduler determines if that scheduling domain 123 is 

#4 available for scheduling. If so, the scheduler proceeds with the flow point 514. If not, the 
%5 scheduler proceeds with the step 511. 



17 At a step 514, the scheduler prepares to run the selected new task 121. The 

18 scheduler performs a context switch into the selected new task 121, and proceeds with the 

19 flow point 520. 

20 

21 At a flow point 520, processor 1 10 is running the selected task's application 

22 code 312. 



Express Mailing EL734816463US 



20 



103.1063.01 



1 Synchronization , Explicit or Implicit 



¥1 



Figure 6 shows a conceptual diagram of explicit synchronization and im- 



4 plicit synchronization. 



6 With explicit synchronization, a first task 121 and a second task 1 2 1 each 

7 attempt to access a shared resource 122, such as a data structure. To prevent improper 

8 concurrent access to the shared resource 122, each task 121 makes explicit calls 601 to a 

9 synchronization mechanism 602. The synchronization mechanism 602 might include a 
j|o lock, a semaphore, a monitor, or other methods known in the art of operating systems. 



With implicit synchronization, it is assumed by the application that the 
s-13 scheduler will provide the synchronization, by not running multiple tasks in the same do- 
: fik main concurrently. The first task 121 and the second task 121 each have an associated 



35 scheduling domain 123. If the two scheduling domains 123 are different, that indicates a 

16 designer's or program coder's decision that the two tasks 121 will not perform improper 

17 concurrent access to the shared resource 122 (in alternative embodiments, different 

18 scheduling domains 123 may indicate that if there is any improper concurrent access to 

19 the shared resource 122, no harm will come to the system 100). If the two scheduling do- 

20 mains 123 are the same, that indicates a designer's or program coder's decision that the 

21 two tasks 121 might perform improper concurrent access to the shared resource 122, thus 

22 that the two tasks 121 are not allowed to execute concurrently. 
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1 The scheduler prevents concurrent execution of the two tasks 121, and 

2 therefore prevents concurrent access to the shared resource 122, as a consequence of the 

3 steps 512 and 513 described above. Because the scheduler refuses to schedule two tasks 

4 121 for concurrent execution on different processors 110 when those two tasks 121 are 

5 associated with the same scheduling domain 123, the two tasks 121 are implicitly syn- 

6 chronized with regard to the shared resource 122. The resource 122 is therefore also asso- 

7 ciated with the same scheduling domain 1 23 . Lack of improper concurrent access to the 

8 resource 122 is therefore an emergent consequence of the scheduler's behavior in refus- 

9 ing to concurrently schedule tasks 121 from the same scheduling domain 123. 

J ; ¥i Tasks and Resources Not In Any Domain 

3=13 Tasks 121 or resources 122 can also be declared by the designer or program 

if 4 coder to not be in any scheduling domain 123. 

"I v 

16 If a task 121 is not in any domain, the designer or program coder thus indi- 

17 cates that the task 121 is MP-safe, that is, that running the task 121 will not result in any 

18 improper concurrent access to any resources 122. A task 121 can be declared MP-safe for 

19 one or more of the following reasons: 

20 

21 • The task 121 does not use any resources 122, or all of its resources are internal to 

22 the task 121. 
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For example, the task 1 2 1 might perform only operations on data structures given 
to it by a requesting task 121 , or the task 121 might use only its own internal data 
structures (dynamically allocated by the operating system, not static). One example 
could be when the task 121 performs an operation that does not involve use of re- 
sources 122, such as a pure calculation. 

• The task 121 already uses explicit synchronization methods to assure it is MP-safe. 

For example, the task 121 might use locks or semaphores to maintain all its data 
structures or other resources 122 MP-safe. A first example could be when the task 
121 is part of the kernel and was already designed to be MP-safe. A second exam- 
ple could be when the task 121 is designed or coded explicitly for safe use in a 
multiprocessor system. 

• Designers or program coders decide that the task's nature is such that it will not 
involve improper concurrent access. 

For example, although the task 121 is not formally MP-safe, and does use re- 
sources 122 that are not formally MP-safe, designers or program coders knowing 
the nature of the system 1 00 might have decided that the task 1 2 1 will not ever be 
run, in normal operation, in an unsafe circumstance. 
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• Designers or program coders decide that the task's nature is such that, even if it 
does involve improper concurrent access, that improper concurrent access would 
be harmless. 

For example, although the task 121 might be able to perform improper concurrent 
access, designers or program coders knowing the nature of the system 100 might 
have decided that in that event, the resources 122 that are improperly accessed do 
not need to be maintained consistent, or that there are adequate recovery proce- 
dures from that event. 

If a resource 122 is not in any domain, the designer or program coder thus 
indicates that the resource 122 is MP-safe, that is, that using the resource 122 will not re- 
sult in any improper concurrent access. A resource 122, and any library code which may 
maintain the resource, can be declared MP-safe for one or more of the following reasons: 

• The resource 1 22 already uses explicit synchronization methods to assure it is MP- 
safe. 

For example, the resource 122 might require locks or semaphores to be accessed. 

• Designers or program coders decide that the resource's nature is such that it will 
not involve improper concurrent access. 
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For example, although the resource 122 is not formally MP-safe, and might be 
used concurrently by multiple tasks 121, designers or program coders knowing the 
nature of the system 100 might have decided that the task 121 will not ever be run, 
in normal operation, in an unsafe circumstance. 

• Designers or program coders decide that the resource's nature is such that, even if 
it does involve improper concurrent access, that improper concurrent access would 
be harmless. 

For example, although the resource 122 might be able to be concurrently accessed, 
designers or program coders knowing the nature of the system 100 might have de- 
cided that in that event, the resource 122 does not need to be maintained consis- 
tent, or that there are adequate recovery procedures from that event. 

Tasks and Resources In More Than One Domain 

Tasks 121 or resources 122 can also be declared by the designer or program 
coder to be in more than one scheduling domain 1 23 . 

In a preferred embodiment, a task 121 can perform a system call to "grab" a 
second scheduling domain 123 for a period of time. In this case, both the task's first 
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1 scheduling domain 123 and the task's second scheduling domain 123 are not free for con- 



2 current use by other tasks 121. 



3 



4 In a preferred embodiment, designers or program coders can declare a re- 



5 source 122 to be "part of both a first scheduling domain 123 and a second scheduling 



6 domain 123. 



7 



8 Preemptive Multitasking 



9 



r — j; 

#) A preferred embodiment described herein uses non-preemptive multitask- 

L¥i ing; that is, a task 121 only blocks if it makes the appropriate system call to the scheduler 

■i i- & 

sfc to block itself and allow another task 1 2 1 to run. 

^13 

34 In a system 100 using preemptive multitasking, a task 121 can be preempted 

rt5 "against its will," that is, without the task 121 necessarily having a chance to assure that 

16 all its resources 122 or other data structures are in order for another task 121 to run. In 



17 this case, the task 121 might have left one or more resources 122 in a state that disallows 



18 other tasks 121 from the same scheduling domain 123 from accessing those resources 



19 122. Accordingly, in alternative embodiments using preemptive multitasking, when a task 



20 1 2 1 is preempted, it becomes the only next task 121 from its scheduling domain 1 23 able 



21 to next run. Thus, the scheduler will select, from each scheduling domain 123, the pre- 



22 empted task 121 over all other tasks 121 in that scheduling domain 123. 
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1 Generality of the Invention 



3 The invention has general applicability to applications of any kind execut- 

4 ing in an MP system in which the scheduler provides implicit synchronization using do- 

5 mains. Although a preferred embodiment is described with regard to a file server, there is 

6 no particular limitation of the invention to file servers or similar devices. Techniques used 

7 by a preferred embodiment of the invention for implicit synchronization, domain migra- 

8 tion, domain switching, and the like can be used in contexts other than the specific appli- 

9 cations disclosed herein. 



si 



WO 



L11 The invention is generally applicable to all applications capable of being 

i; n i: 

^fe run in a multiprocessor system, and to any multiprocessor system in which the scheduler 

-13 (or equivalent part of an operating system) can be used to enforce implicit synchroniza- 



-ts; 



354 tion as described herein. 



16 Other and further applications of the invention in its most general form 

17 would be clear to those skilled in the art after perusal of this application. The invention 

18 would be usable for such other and further applications without undue experimentation or 

19 further invention. 

20 
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1 Although preferred embodiments are disclosed herein, many variations are 

2 possible which remain within the concept, scope and spirit of the invention; these varia- 

3 tions would be clear to those skilled in the art after perusal of this application. 
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